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Introduction 

What is this paper about? 



Data conversion programs 



Programs that: 

O Determine the shape of data 

O Traverse data 

O Package data 

O Pretty print 

They have in common that each of them has an inverse 



Conversion programs are expressed using John Hughes' arrows. 

The laws needed to prove the correctness of conversion can be seen as 

restrictions on the possible implementations. 

□ a - = I -0Q.0 
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Introduction 

What is this paper about? 



Goals 



• Construct a number of polytypic programs for data conversion 
problems, with their inverses 

» Show how to construct and calculate with polytypic functions 



Connections with the course 



• Provides programs that work on generic datatypes 

• Served as a base for Generic Haskell 



Polytypic Data Conversion Progran 



& -OQ.O 



October 23, 2012 



Definition! 

if Contents 



Q Definitions 
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Polytypic function 



A polytypic function is a function parametrized on type constructors. Can 
either be defined by induction on the structure of user-defined datatypes or 
by other functions. 



Forward composition 



f;g = gf 



& -OQ.O 
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Definitions 

PolyP, a haskell 



In this paper most code is created in PolyP 



PolyP 



PolyP extends Haskell with a construct for writing polytypic functions. 

• Looks a lot like generic libraries 

• Data conversion functions are polytypic functions in PolyP 

• Has functions in and out which work like the from and to (as seen in 
the course) 



Polytypic Data Conversion Progran 



October 23, 2012 



Definitions 

PolyP, a haskell 



PolyP pattern functors grammar: 

f, g, h ::= g + h | g * h | Empty | Par | Rec | d 0 g | Const t 



For instance we can write a Rose in PolyP grammar in this way: 

data Rose a = Node a (List (Rose a )) 
^Rose = Par * (List @ Rec) 



J 
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Map functions 



mapd :: Regular d =>• (a — > b) — >■ (d a — >■ c/ b) 
mapdP = outrf ; map2d p (mapd p) ; irid 
map2 f ::{a^c)^{b^d)^(fab^fcd) 



polytypic map2 :: (a — > c) — > (b — > d) — > (f a b — > f c d) 
= Ap p' — > case f of 

g + h map2 ff pp' + map2/, p p' 
g- * /? — >■ map2 g p p' * map2f 1 p p' 
Empty — > id 
Par — > p 
Rec p' 

d @ g -f map d {map2 g p p') 
Const t — > id 

□ a - = I ^)o.o 
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Polytypic programming 

if Contents 



Q Polytypic programming 

• Shape and contents 

a Additional definitions: Arrows 

• Traversal 
a Packaging 

a Show and read 
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Polytypic programming Shape and contents 

ntents 



Functions are shown for 

• Separating a datatype value into its shape and its contents 

• Combining shape and contents to a datatype value 

separate :: Regular d =>■ d a — > ( d(), [a] ) 
seperate x = (shape x, flatten x) 
shape :: Regular d =>• d a — > d () 
shape = map (const ()) 
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Polytypic programming Shape and contents 

ntents 



Writing combine would be slightly harder, we'll have to traverse the shape 
and insert elements from a list. 



Problem 



How do we prove combine is the inverse of separate? 



Solution: arrows! 



Polytypic programming Shape and contents 



contents 

;cue 



Fuse flatten and shape together into an arrow map that keeps track of 
state and does one single traversal. To avoid altering all types with state 
information a new type is created, constructor SA, for functions that 
side-effect on a state. 

data SA s a b = SA ((a, s) ->■ (b,s)) 
Notation for this in the paper is a ~» s b. 
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Polytypic programming Shape and contents 

Shape and contents 

Redefining separate 



Now we can use this type and an arrow map (called mapAr) to redefine 
separate. 

separate :: d a ~^>\ a ] d() 
seperate = mapAr put 
put :: a ~> [a] () 

put = SA (A (a, as) -> ((), a : as)) 
mapAr :: (a ~~* s b) — > (d a ~~* s d b) 

The r in mapAr denotes the direction of the traversal, for instance, a given 
tree would be traveled right first then the left part. 



□ 9 



Polytypic program 


ming Shape and contents 


Shape and contents 




Creating combine 





Using the left to right traversing variant of the arrow map, mapAl, we can 
write combine. 



combine :: d () ~-»r a i d a 
combine = mapAl get 

get :: () ~» [a] a 

get = SA (A ((), a : as) -> (a, as)) 
mapAl :: (a ~~> s b) — > (d a ~» s d b) 



Now with both separate and combine as arrow maps we can show that 
they're eachothers inverses 



Polytypic Data Conversion Progran 



October 23, 2012 15 / 40 



Polytypic programming Shape and contents 

Shape and contents 



Because we can't use function composition on our SA we create a new 
composition operator 

:: (a b) -)• (b c) ->• (a ~~» s c) 
SA f ;gg> SA g = SA (f ; g) " I 

Now we can proof separate and combine are inverses by proving put and 
get are inverses (recall definitions of separate and combine) 

put get 

= {Definitions of get and pur} 

SA (A (a, as) -> ((), a : as)) ^> SA (A ((), a : as) -> (a, as)) 
= {Definition of (^>) } 

SA (A (a, as) -> ((), a : as)) ; SA (A ((), a : as) -> (a, as)) 
= {Simplification} 
SA id 

i 
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Polytypic programming Additional definite 

dditional definitions: Arrows 



Using arrows to generalize monads gives a wider applicability. 

We create a function that lifts function to the level where they can take 
and return states. 

arr :: (a — > b) — > (a ~» s b) 
arrf = SA (f^id) 

A different way to write this is ~~t instead of arr f. 
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Polytypic programming Additional 

Additional definitions: Arrows 

Arrows between pairs 



For pairs a function first is defined which applies an arrow to the first 
component and leaves the second unchanged. 

first :: (a ~~» s b) ->■ ((a, c) ~~» s (b, c)) 

first (SA f) = SA (A((a, c), s) -> let (b, s') = f (a, s) in ((b, c), s')) 



A function second does the same but then for the second component. 



second :: (a ~» s b) — > ((c, a) (c, b)) 
second f = swa}) ^> first f ^> swa}) 



J 



Polytypic Data Conversion Programs October 23, 2012 



Polytypic programming Additional definitions: Arrows 

Additional definitions: Arrows 



• Type SA s a b encapsulates function from a to b that manipulate 
state s 

• But what about functions that don't refer to the state? 



class Arrow (~») where 
arr :: (a — > b) — > (a b) 

(;§>) :: (a ~> b) ->■ (b ~» c) ->■ (a ~» c) 
first :: (a -w b) -> ((a, c) ~> (b,c)) 



Arrows imply a number of laws (for sake of time left out here). 
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Polytypic programming Additional 

Additional definitions: Arrows 

Choice operator for arrows 



Subclass ArrowChoice for choice operators. 

class Arrow (~») => ArrowChoice (~») where 
(+H-) (a ~» c) — >■ (b ~» d) — >■ (Either a b ^ Either c d) 
(HI) :: (a ~» c) -4 (b ~> d) ->■ (Either a b ~» c) 



Combining the classes Arrow and ArrowChoice we can create instances for 
SA s and with those create easier proof for our conversion programs. 
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>tc programming 



Traversa] 



How to traverse a datatype? We've seen mapAr and mapAl before. 

mapAr :: Arrow/Choice =>■ (a ~~» fa) — > (c/ a c/ fa) 
mapAl :: /4rroi/i/C/?o/'ce (-w) =4> (a fa) — > (c/ a ~» c/ fa) 

Their definitions of their arrow maps: 

mapAr d p = out^ T/? d p (mapAr d p) ^> In^ 
map A^ u = outd ;g> TL d u (mapAld u) ;g> /'ajj 

TR and TL are the traversal functions. 
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I raversa 

Traversal functio 



Their definitions: 

polytypic TRf :: (a c) — » (b ~» d) —> (f a b ~~> f c d) 
= Ap p' — >■ case f of 
g + /) TR g p p' +++ TR h p p' 
g * h — >■ TR g p p' <* TRh p p' 
Empty — > id 
Par — > p 
Rec p' 

d @ g ->■ mapAr d (TR g p p') 

Const t — >■ /d 

TL is quite simular but uses *> in the product case to traverse from left 
to right and some recursive calls are changed. 

□ a - = I -0Q.0 
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ling Packaging 



The next program the paper discusses is a packaging program, lets first 
define this: 



Packaging program 



Given a datatype value (an abstract syntax tree), construct a compact (bit 
stream) representation of the abstract syntax tree. 

An example is the binary tree treeShape: 

treeShape :: Tree () 

treeShape = Bin( Bin( Leaf() ) ( Bin( Leaf() ) ( Leaf() ))) ( Leaf() ) 

Which can be converted to the following bit stream: 

Bin (Bin (Leaf ()) (Bin (Leaf ()) (Leaf ()))) (Leaf ()) 
110 10 0 0 
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>tc programming Packaging 



Creating a packaging program we need to do three things 

• Define function pack, which takes an element level packer to a 
datatype level packer 

pack :: (a ~» ()) — > (d a ~» ()) 

• Define function unpack, which takes an unpacker on the element level 
and takes it to an unpacker on the datatype level 

unpack :: (() ~~* a) — > (() ~~* d a) 

• Proof that if p and u are inverses on element a, then pack p and 
unpack p are inverses on the datatype d a 
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Functions pack and unpack are defined by recursion on top level by PT 
and UT, packing and unpacking functions on top level. 



packj p = Pj noOfConSd p (packj p) out 
unpack^ u = Uj noOfConsj u (unpack^ u) 3g> in 

The arrow pack p uses printer p for printing arguments. 

The function noOfCons is just an integer which is the number of 

constructors possible. 
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Pol 

Packaging 

Actual packing and unpackir 



P T :: Int (a ~» ()) -»• (b ~» ()) -)■ (f a 6 ()) 
Pj n p p' = packCon n <gC Ps p p' 



P5 packs a value with p for parameters and p' for recursive structures. 
Arrow packCon converts constructors to numbers. 

U T ■■■ Int -> (() a) -> (() ~» 6) -> (() r" a 6) 
Uj n u u' = unpackCon n ^> Us u u' 

Arrow unpackCon takes a constructor number and converts it to a 
constructor. Us uses parsers u to fill parameters and u'for the recursive 
slots. 



& -00.0 
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Packagi 

Ps and Us 




The last part is defining Ps and Us, very similar to mapAr and mapAl 
since they traverse the data structure. 

• Difference between Ps and Us is (like mapAr and mapAl the traversal 
direction 

• Inverse proof is very similar 

• Differ in the way sums are handled 
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Polytypic programming Show and read 

Show and read 

And more arrows 



How to define polytypic versions of show and read? 

• New class ArrowReadShow with four classes of operations: 

O ArrowZero (error handling) 

Q ArrowPlus (error handling) 

O ArrowSymbol (print/parse symbols) 

Q ArrowPrec (operator precedences) 
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Polytypic programming Show and read 



Show and read 

Failing arrows 



Data conversion with possible failure. ArrowZero is defined for pure failure 

class Arrow =>• ArrowZero (~») where 
zeroA :: a ~~> b 



ArrowPlus has operators which take alternatives if one argument fails. 

class ArrowZero (y*) =>• ArrowPlus (~*) where 
(< | >) :: (a b) — >• (a ~~* c) — > (a ~~» Either b c) 
(<+>) :: (a -w b) (a -w b) (a -w b) 

(< | >) makes choice depending on hidden state or depending on 
summand in output. 

(<+>) uses second arrow argument if first fails. 
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Polytypic programming Show ani 

Show and read 

Reading and writing symbols 




Class for printing and parsing symbols: 

class Arrow (~») =>• ArrowSymbol (~») where 
readSym :: Symbol — > (a ~~> a) 
showSym :: Symbol — >• (a a) 

type Symbol = String 

Class that handles precedence (priority) levels. 

class ArrowSymbol (~») =>■ ArrowPrec (~») where 
setPrec :: Prec — )■ (a b) — >• (a ~* b) 
readPrec :: Prec — > (a ~» b) — >• (a ~~> b) 
showPrec :: Prec — >■ (a ~~* b) — > (a ~~* b) 
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Polytypic programming Show and read 

Show and read 



The polytypic functions show and read defined in the paper use the 
operators from the previous declared classes. 

class (ArrowChoice (~»), ArrowPlus (~»), ArrowPrec (~>)) => 
ArrowShowRead (~>) 



We create the conversion program by using four levels: 
O Top level recursion: show and read 
Q Breaking down sum structure: Ss and Rs 
O Breaking down product structure: Sp and Rp 
Q Dealing with parameters and other datatypes: Sr and Rr 

□ g - = | 
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Polytypic programming Show and read 

Show and read 



The four levels will in short do: 
O Expose top level and handle recursion 
Q Handle parentheses 

Q Insert spaces between arguments of constructors 
O Apply show (or read) functions on parameters 



Polytypic programming Show and read 

Show and read 

The code 



showd s = Ss,j constructors^ s (showd s) <^C ouid 
readd r = Rsj constructors^ r (readd r) 3§> /Vty 



Ssf :: [Constructor] ->• (a ()) — > (b ()) — > (f a b ~» ()) 
Rsf :: [Constructor] -4 (() ~~» a) -> (() 6) — > (() f a b) 

Where [Constructor] is the list of constructors of datatype d a. 

polytypic Ssf :: [Constructor] — >■ (a -w ()) — >• (fa ~» ()) — >■ (f a b ()) 
= A(c : cs) ss'-> case f of 
£ + /?—>■ Ssf [c] s s' HI Ss/, cs s s' 

g — )■ showParen(prec c)(showSym(name c) Sp^ s s') 



J 
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Show and read 

Products 



Polytypic programming Show and read 



Product case for show: 

polytypic Spf :: (a ~~* ()) — > (b ~* ()) — >■ (f a b ~» ()) 
= As s' — > c ase f of 

g*h -> A() -»■ (( ),()) « (Sp g s s' <* Sp ft s s') 
Empty -> A() -»■ () 

g- — >■ s/?oi/i/Sym " " <gc setPrec high (Sr g s s') 
Rest case for show: 

polytypic Sr f :: (a ()) — > (b ~~* ()) — )• (f a 6 ~~> ()) 
= As s' — > case f of 

Par -> s 
A?ec -> s' 

d @ g ^ show c j{Sr g s s') 
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Results 



Q Results 
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es 



We've seen four data conversion programs being defined: 



Shape and contents 



separate :: d a d () 
combine :: d () ~~*r a i d a 

separate ^> combine = id 



Arrow maps (traversal) 



mapAr :: ArrowChoice =^ (a b) — > (d a d b) 
mapAl :: ArrowChoice (~») => (a b) — > (d a ~* d b) 
p ^> u = / =>• mapAr p 3^ mapAl u = map / 



□ a 
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es 



Packing 



pack :: ArrowPack =>- (a ~~» ()) — > (d a ~~* ()) 
unpack :: ArrowPack (~») => (() ~» a) —>•(() d a) 
p»u = / =>■ pack p ;g> unpack u = map i 



Pretty printing 1 


show :: ArrowShowRead (~>) => 


(a 


-())- 


+ (d a ~» ()) 




read :: ArrowShowRead (-**) =>• 
— >■ 

s»r= / =>■ show s read 


(0 


~-> a) — 

H 

map / 


► (() d a) 




r = 







These last two can be used together to create compression and 
decompression. 
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le of Contents 



Q Conclusion 
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• Since inverse functions are implemented inside the construction of the 
programs, the size and complexity of the code are reduced compared 
to other work (example: polytypic read and show by Bjork and 
Huisman). 

• Time and space efficiency not optimal, might be improved in future 

• Using arrows creating the programs simplified the construction and 
form, made it easier to create inverses and proof their correctness. 

• The programs shown in the paper serve as a base for other programs 
that solve their data conversion problems. 

• For instance, Generic Haskell is based upon this 
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• Has a nice introduction 

9 Long read because of the new definitions (arrows, etc) 

• Interesting concepts 
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